Speech Recorder and Translator using Google Cloud Speech-to-Text and Translation

نویسندگان

چکیده

The most popular video website YouTube has about 2 billion users worldwide who speak and understand different languages. Subtitles are essential for the to get message from video. However, not all owners provide subtitles their videos. It causes potential audiences have difficulties in understanding content. Thus, this study proposed a speech recorder translator solve problem. general concept of was combine Automatic Speech Recognition (ASR) translation technologies recognize content translate it into other This paper compared discussed three ASR technologies. They Google Cloud Speech-to-Text, Limecraft Transcriber, VoxSigma. Finally, system used Speech-to-Text because supports more languages than Transcriber Besides, flexible use with Translation. also consisted questionnaire crucial features translator. There total 19 university students participated questionnaire. Most respondents stated that high accuracy is vital system. related work recognition between ordinary voice impaired voice. mobile application record acoustic input. Compared existing App, project web application. new study, especially terms development user experience. developed successfully. results showed Translation were reliable translation. could when background music too loud. had problem direct translation, which challenging. future research may need custom trained model. In conclusion, contribute idea language barrier on watching platform.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

the effects of speech rate,prosodic features, and blurred speech on iranian efl learners listening comprehension

کلید واژه ها به زبان انگلیسی: effect of speech rate on listening comprehension, blurred speech,segmental and suprasegmental features,authentic speech,intelligibility, discrimination, omission, assimilation چکیده: سرعت مطالب شنیداری در کلام پیوسته بطور کلی همواره کابوسی بوده برای یادگیرنده های زبان دوم و بالاخص برای شنوندگان ایرانی. علی رغم عقل سلیم که کلام با سرعت کندتری فعالیتهای درک مطلب شن...

15 صفحه اول

RANS : a Speech - to - Speech Translator Prototype

EUTRANS system is a telephone speech input translation prototype capable of translating telephone calls from one language to another. It assumes a human to human communication , each one speaking a different language, assisted by a system with translation capabilities. The prototype has been developed as a demonstrator for the Eu-ropean project with the same name. EUTRANS achieves a response ti...

متن کامل

Eutrans: a speech-to-speech translator prototype

EuTrans system is a telephone speech input translation prototype capable of translating telephone calls from one language to another. It assumes a human to human communication, each one speaking a different language, assisted by a system with translation capabilities. The prototype has been developed as a demonstrator for the European project with the same name. EuTrans achieves a response time...

متن کامل

A Framework of Translator From English Speech To Sanskrit Text

Human computer interaction is defined as Users (Humans) interact with the computers. Speech recognition is an area of computer science that deals with the designing of systems that recognize spoken words. Speech recognition system allows ordinary people to speak to the system. Recognizing and understanding a spoken sentence is obviously a knowledge-intensive process, which must take into accoun...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of IT in Asia

سال: 2021

ISSN: ['1823-5042']

DOI: https://doi.org/10.33736/jita.2815.2021